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the amount of a nucleic acid by primer-directed ligation. 
Another object of the present invention is to provide meth- 
ods for the detection of genes and the analysis of gene 
expression. Yet another object of the present invention is to 
provide methods for the detection of genetic mutations. Still 
another object of the present invention is to provide methods 
for the analysis of the base sequence of a nucleic acid. 

GENERAL DESCRIPTION 

It has been discovered that a series of short 
oligonucleotide-5*-phosphates can be simultaneously ligated 
onto a template-bound primer in a contiguous manner to 
produce the complementary strand of a template polynucle- 
otide or nucleic acid. The nucleic acid produced can be 
either labeled or unlabeled by using either labeled or unla- 
beled short oligomers. The oligomers in the set each pref- 
erably contain the same mmiber of bases. When a sequence 
to be synthesized is known exactly, a set containing the 
minimum number of oligomers can be used. The oligomers 
are ligated in the correct order starting from the primer, to 
produce the correct sequence. Primer-independent Ugation 
does not occur when using oligonucleotides of lengths 5 
bases. When the sequence to be synthesized is not known, a 
library of a large number of the total possible pool of 
oHgomers is used. The latter situation occurs in sequence 
analysis and mutation screening. Ligations are preferably 
conducted by means of a hgase enzyme. Known chemical 
agents for ligating nucleotides and oligonucleotides can be 
employed as well. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 schematically depicts the ligation of two oligomers 
PI and P2 onto a template-bound primer in the presence of 
a competing nonligatable complementary oHgomer P3. 

FIG 2 schematically depicts a method for amplifying the 
amoimt of a nucleic acid using template -bound primer 
directed ligation of multiple oligomers. Synthesis is shown 
occurring in one direction in each strand, but can also be 
accomplished bidirectionally as described below. 

FIG. 3 schematically depicts a method for detecting a 
point mutation in a gene by the ligation of detectably labeled 
mutation specific oligomers onto a template-bound primer. 

FIG. 4 schematically depicts a method for detecting two 
different genotypes of a mutation in a gene by the ligation of 
different sets of detectably labeled mutation specific or 
wild-type specific oligomers onto a template-bound primer. 
The mutation specific oligomers bear a first label while the 
wild-type speciiac oligomers bear a second label. 

FIG. 5 depicts an example of branched DNA or amplifi- 
cation multimers. 

FIG. 6 depicts an adaptation of branched DNA in which 
at least the finst branch is prepared by ligation of labeled 
oligomers for providing branch points at regularly spaced 
intervals. 

FIG. 7 schematically depicts a method for determining the 
sequence of a nucleic acid by the ligation of unique labeled 
oHgomers onto a template-bound primer, cleaving the labels 
and analysis of the mass of each unique label. 

DETAILED DESCRIPHON OF THE 
INVENTION 

Definitions 

Ohgomer, ohgonucleotide — as used herein will refer to a 
compound containing a phosphodiester internucleotide link- 
age and a 5'-terminal monophosphate group. The nucle- 
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otides can be the normally occurring ribonucleotides A, C, 
G, and U or deoxyribonucleotides, dA, dC, dO and dT. 

Primer or probe/primer — ^refers to an oligonucleotide 
used to direct the site of ligation and is required to initiate 
5 the ligation process. Primers are of a length sufEicient to 
hybridize stably to the template and represent a unique 
sequence in the template. Primers will usually be about 
15-30 bases in length although longer primers can be used. 
Labeled primers containing detectable labels or labels which 
allow solid phase capture are within the scope of the term as 
used herein. Primer also contemplates contiguously stacked 
oligomers of at least six bases as is known in the art (T. 
Kaczorowski and W. Szybalski, Gene, 179, 189-193 
(1996)). 

Template, test polynucleotide, target are used inter- 
changeably and refer to the nucleic acid whose length is to 

be replicated. 

Sample — A fluid containing or suspected of containing 
one or more analytes to be assayed. Typical samples which 
are analyzed by th& chemiluminescent reaction method are 
20 biological samples including body fluids such as blood, 
plasma, serum, urine, semen, saliva, cell lysates, tissue 
extracts and the like. Other types of samples include food 
samples and environmental samples such as soil or water. 

Short oligonucleotide — As used herein, a ohgonucleotide 
25 5 -phosphate of at lest two and up to about 10 base length. 
The bases can be ribonucleotides or deoxyribonucleotides or 
analogs thereof. The length of a short ohgonucleotide useful 
in a given context can vary within this range and may be less 
than the whole range. The preferred length varies depending 
on the particular application. 

Specific binding pair — Two substances which exhibit a 
mutual binding afBnity. Examples include antigen-antibody, 
hapten-antibody or antibody-antibody pairs, complementary 
oligonucleotides or polynucleotides, avidin-biotin, 
streptavidin-biotin, hormone-receptor, lectin-carbohydrate, 
IgG-protein A, nucleic acid-nucleic add binding protein and 
nucleic add-anti-nucleic acid antibody and metal complex- 
ligand. 

One object of the invention therefore is method for 
^ synthesizing a strand of a nucleic acid complementary to at 
least a portion of a target single stranded nucleic acid 
template comprising: 

a) providing a primer which is complementary to a 
portion of the target single stranded nucleic acid tem- 

45 plate; 

b) hybridizing the primer with the template to form a 
primer-template hybrid having a single stranded region 
and a double stranded region; 

c) contacting the primer-template hybrid with a plurality 
50 of oligonucleotide 5'-monophosphates; 

d) ligating to the primer-template hybrid in sequence at 
least some of the plurality of oligonucleotide 
5'-monophosphates to extend the double stranded 
region and thereby synthesize a nucleic add strand 

55 which is complementary to the portion of the template. 
A preferred method of ligation uses a Hgase such as a 
DNA ligase. Representative ligases include T4 hgase, T7 
Hgase, Tth Hgase, Taq Hgase and E. coli DNA ligase. The 
Hgase can be a thermostable Hgase, in which case thermal 

60 cycHng techniques as discussed below are possible. Thermal 
cycling with a thermostable ligase is useful in methods of 
amplifying nucleic acids in a manner analogous to the 
polymerase chain reaction, but using oHgomers and a Hgase 
in place of dNTPs and a polymerase. Methods of performing 

65 enzymatic ligation reactions are generaUy described in e.g., 
Sambrook, et al.. Molecular Cloning: A Laboratory Manual, 
2nd Ed., Cold Spring Harbor Laboratory, New York, 1989. 
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Enzymatic ligation reactions are generally performed in a 
buffer solution, optionally in the presence of additives to 
promote hybridization. The buffer has a pH typically in the 
range of 6-9, more usually 7-8.5 and preferably in the range 
7.5--8. Buffers capable of maintaining a pH in this range are 5 
suitable. The reaction can be performed over a range of 
temperatures in the range of 0 to about 50° C. Optimal 
temperatures will vary over the range depending on the 
nature and size of oligonucleotide phosphates to be ligated, 
the enzyme, presence and amount of additive and can be lo 
optimized empirically with reference to the general literature 
on ligases and by reference to the specific examples below. 
The length of time for performing the ligation can be as short 
as a few minutes up to several hours, although it is desirable 
to conduct the reaction as rapidly as possible. Single 15 
stranded DNA binding proteins can be added to ohgonucle- 
otide ligation reactions to improve their efficiency. Their 
effect is due to their relaxation of any secondary structure 
that is in the template strand thus allowing the complemen- 
tary oligonucleotides to bind and ligate. E. coli single 20 
stranded binding protein (Promega, Madison, Wis. or 
Amersham/USB) and T4 Gene 32 protein (Boehringer 
Mannheim, Indianapolis, Ind.) can be used. The use of 
volume excluding agents such as polyethylene glycols 
(PEG) may be advantageous in promoting ligations. Inclu- 25 
sion of up to 200 mM NaQ may also be useful for promotiag 
ligations. The use of other additives in enzymatic ligations 
is contemplated and is within the scope of the present 
methods. Additives include phosphate transfer agents such 
as ATP, sulfhydryl reagents, including DTT and 30 
2-mercaptoethanol, and divalent cations such as Mg*^ salts. 

Ligation of oligomer 5'-phosphates also comprehends 
nonenzymatic methods of ligation as well. Chemical 
reagents which effect the formation of the phosphodiester 
internucleotide bond are known (CNBr: K. D. James, A. D, 35 
Ellington, Chemistry & Biology, 4,595,605, (1997); 
N-cyanoimidazole: T Li, K. C. Nicalaou, Nature, 369, 
218-221 (1994); EDAC: D. Sievers, G. Von Kiedrowski, 
Nature, 369, 221-224 (1994)). Chemical ligation methods 
have not been applied to methods of sequence analysis. 40 

Incorporation of mismatched oligomers can occur as in 
other techniques, especially when the sequence has a high 
G-C content. The occurrence of mismatches is controllable 
as is the case with other hybridization methods. 
Temperature, salt concentration, and additives can all be 45 
employed in art-recognized manners to control the strin- 
gency of the hybridization process. Since the effect of a 
mismatch on a small oHgomer should be proportionately 
greater than on a larger one, discrimination of improper 
sequences may show improvement over other ligation tech- 50 
niques. 

Another embodiment uses a library of possible sequences 
to achieve the ligation of a series of short oligomers of 
length n bases to synthesize a complementary nucleic acid. 
The library contains many more possible combinations of 55 
the n bases (n-mers) than are required to form the product 
nucleic add. When n«5, for example, there are 4^ or 1024 
possible 5-mers which contain the four naturally occurring 
bases A, C, G, T and U. Hie library can contain all 4** 
possible oligomers or less than the fuU set, but should 60 
contain at least a substantial proportion (>50%, and 
preferably>75%, most preferably>90%) of the possible oli- 
gomers. 

Known methods of synthesizing polynucleotides, by 
polymerase extensions with dntps or ligation of preformed 65 
oligonucleotides, function by providing only a small number 
of different reactants for incorporation into the product 
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molecule. Known ligation-based methods usually preselect 
the one oligonucleotide with the correct sequence. Poly- 
merase extension methods supply the four individual bases 
for incorporation. The present methods differ fundamentally 
in providing a large number of potential reactants into the 
reaction mixture. Moreover, a significant number of the 
short gligonucleotides have a sequence appropriate for 
hybridization to the target but, if hybridized, would block or 
prematurely terminate the ligation process. 

As seen in FIG. 1, oligomer P3 is complementary to a 
portion of the target sequence, but, if hybridized, would 
blodc ligation of PI and P2 to the primer. Surprisingly, the 
presence of complementary oligomers which can not be 
ligated onto the primer does not interfere with or prevent the 
successful ligation of the desired oligomers to the template- 
bound primer. 

The library also will contain a majority of oligonucleotide 
sequences which are not complementary to the target or only 
partially complementary. This excess of oligonucleotides, in 
effect, competes with the correct sequences for recognition 
and ligation. Nevertheless, ligation of short oligonucleotides 
in the correct order does occur effectively in spite of the 
statistical unlikelihood. The ability to faithfully replicate a 
nucleic acid by successive ligation of many short oligo- 
nucleotides in one step is unexpected and greatly simplifies 
the process compared to others known in the art. 

The length of oligonucleotides to use in the present 
methods is governed by the interplay of several competing 
factors. Larger oligomers will hybridize more strongly under 
a given set of conditions (salt concentration, temperature) 
and can therefore hybridize at a higher temperature. As the 
length of the oligonucleotide increases, the number of dis- 
crete compounds required to assemble the complete library 
of all possible n-mers increases by a factor of 4 for each unit 
mcrease of n. 



Length of Oligomer 


Total # of Seqences 


1 


4 


2 


16 


3 


64 


4 


256 


5 


1024 


6 


4096 


7 


16,384 


8 


65,536 


9 


262,144 


10 


1,048,576 



Shorter oligomers require less compounds to construct the 
entire library, but become more difficult, e.g. lower 
temperature, to hybridize and ligate as their length 
decreases. This, in tum, translates to greater stringency at a 
given temperature. Still another factor is the ability of the 
oligonucleotide to hybridize and initiate extension at a site 
not associated with the primer. Primer-independent hybrid- 
ization has been demonstrated to occur, under the right 
conditions, with oligonucleotides as small as 6 bases. Liga- 
tion of 2 or more contiguous hexamers to produce e.g., a 
dodecamer or octadecamer, then effectively produces a new 
primer. If this happens, the ability to control the starting 
point for polynucleotide synthesis is compromised. On the 
other hand, the probability of finding multiple occurrences 
of a given sequence in a nucleic acid of hundreds of bases 
increases substantially as shorter oligonucleotides are used. 
In applications involving sequence determination, it is desir- 
able to avoid or minimize the occurrence of duplicate 
sequence elements. The selection of the optimum length 
oligonucleotide to use is a compromise among these con- 
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Aiding effects. The optimum lengtii will be different in extension and dehybridization results in a two-fold ampli- 

different end uses. fication of the target sequence. Since heating is generally 

In practice it may not be necessary to use the full library required for separating the newly synthesized duplex nucleic 

of oligonucleotides of length n. When the number of oligo- acid, additional ligase may need to be added in subsequent 

nucleotides required to produce the given sequence is small 5 rounds of ligation-extension. Alternatively, the process can 

compared to the total number of oligonucleotides in the be performed with a thermostable ligase. Thermal cycling 

library, partial libraries can be used and still maintain a high can then be performed without replacing the ligase every 

probability that all of the required oligonucleotides will be cycle. 

present. In some instances it may be desirable to exclude In accordance with the above description there is provided 

certain sequence oligonucleotides which hybridize too lO a method for amplifying the amount of a portion of a double 

weakly or strongly. stranded nucleic add having a first strand and a second 

It is not necessary in the present methods, except as strand comprising: 
explicitly noted below, that each component of the set of a) providing a first primer which is complementary to a 
oligonucleotide 5 -phosphates used in a given method be of region of the first strand and a second primer which is 

the same number of bases. It can be advantageous in some 15 complementary to a region of the second strand 
embodiments to use a combination of oligomers of two or wherein the first and second regions define the portion 

more different lengths, such as pentamers and hexamers, in of the double stranded nucleic acid to be amplified; 

order to avoid the occurrence of duplicate oUgomers. providing a plurality of oligonucleotide 

In another embodmient, template-directed ligation of a 5'-mononhosphates- 
plurality of a set of short oligonucleotides of the same length 20 . c ! ^ j*jc.ujui 

^ , ' i_ _f J • u- u *- 1 c) separating the first and second strands of the double 

onto a primer can be performed m a manner which controls * j j i • • j 

. J . , p , f. . c ^ J ui stranded nucleic acid: 

the endpomt of the ligation by the use of nonextendable ... n . -ii. 

oligoraers. A nonextendable oligomer can contain the same ^) hybndizmg the first and second pruners with the 
or a different number of bases as the other oligomers in the separated strands; 

set. The nonextendable oligomer contains a 5'-phosphate so 25 e) ligating onto the hybridized first and second primers in 
that it can be Hgated, but lacks the 3'-0H group. It could, for sequence at least some of the plurality of oligonucle- 

example, have a dideoxy base at the 3'-end of the oligomer otide 5'-monophosphates to extend the double stranded 

so that there is no 3'-OH for ligation. Another type of region and thereby synthesize a nucleic acid strand 

nonextendable oligomer contains a blocked 3'-0H group for which is complementary to the portion of the template; 

example where the hydroxyl group is blocked with a methyl 30 and 

group or a phosphate group, to prevent subsequent ligation. f) repeating steps c-e as many times as desired to increase 
Modifications to the terminal base which prevent ligation are the amount of die amplified portion of double stranded 

another possible type of nonextendable oligomer. The non- nucleic acid. 

extendable oligomer can be labeled or unlabeled, depending In a preferred embodiment of an amplification process, 

on the need. A preferred embodiment is to use oligomers 35 the set of oligomers is preselected to contain only those 

containing a dideoxy base at the 3'-terminus. oligomers necessary to replicate the two strands, i.e. those 

Another aspect of the present invention is a method for oligomers occurring on the two strands in the region 

synthesizing a nucleic acid by ligation of a plurality of spanned by the two primers. In another preferred 

oHgonucleotide 5'-phosphates onto a template-bound primer embodiment, nonextendable oligomers are used for the 

in both the 5'^3' and 3'-^5' direction at the same time. This 40 terminal positions of each strand. These two terminating 

process can be performed, for example, by providing a oligomers, by definition, have a base sequence complemen- 

5'-phosphate group on the primer. Ligation can occur simul- tary to the first group of bases of the length of the oligomer 

taneously from both termini of the primer as long as the at the 5' end of each primer. 

appropriate ligatable oligomers are provided. The point of Amplification methods in accordance with the present 

termination of synthesis in either or both directions can be 45 invention can be achieved by synthesis of each strand in both 

controlled by the use of nonextendable oligomers or by the S'-^-S' and y->5' direction at the same time. This 

excluding selected oligomers. A nonextendable oHgomer for bidirectional amplification process can be performed, for 

terminating synthesis in the y^5' direction would not have example, by providing a 5'-phosphate group on the primer, 

tiie 5'-phosphate group. Ligation can occur simultaneously from both termini of each 

The template-directed ligation of a plurality of a set of so primer as long as the appropriate ligatable oligomers are 

short oligonucleotides onto a primer can be used in a method provided. The point of termination of synthesis in either or 

of amplifying the quantity of a target DNA. Accordingly, both directions can be controlled by the use of nonextend- 

another aspect of the invention comprises a method of able oligomers or by excluding selected oligomers as 

amplifying a target nucleic acid using a ligase, two primers described above. 

and a set of short oligonucleotides where the probes are 55 As is the case with other uses of the present oligomer 

complementary to regions on opposing strands spanning the ligation method of synthesizing nucleic acid, either labeled 

region of the target to be amplified. At a minimum, the or unlabeled oligomers can be used. The set of oligomers 

oligomer set supplied for reaction must contain those oli- used can be the entire library, a substantial portion of the 

gomers required to extend both primers on their respective library or a preseleaed subset if the sequence to be amplified 

strands as far as the position corresponding to the 5' end of 60 is known in advance. 

the other primer. Additional oligomers can be included, for In another aspect, the method of synthesizing specific 

example as would occur when using the entire library of nucleic acid sequences by figating oligomers onto target 

oligomers instead of preselecting the set of oligomers. The bound primers can be used in diagnostic applications. Spe- 

process, shown schematically in FIG. 2, is distinct from the cific sequences characteristic of the target of interest can be 

polymerase chain reaction, PGR, but using a fibrary of 65 detected using labeled oligomers in the method of synthe- 

oligomers and a ligase instead of the four deoxyribonucle- sizmg the new strand. When the base sequence of the target 

otides and a polymerase. Each cycle of annealing, ligase nucleic acid region is known, the corresponding oligomers 
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needed to complete this sequence are used, at least some of 
which should carry a detectable label. Such methods have 
use in many areas of nucleic acid diagnostics, including 
detection of infectious agents such as C trachomatis and JV. 
gonorrhoeae, R carinii, M. tuberculosis, detection of food 
borne pathogens such as Salmonella and E. coli, meUiods of 
detecting the expression of genes in high throi^hput screen- 
ing assays, methods of detecting genetic abnormalities, 
forensic testing of DNA samples from suspected criminals, 
identity matcMng of human remains and paternity testing. 

In the area of genetic abnormality testing, one application 
is a method for the detection of genetic mutations. The 
mutations can be a point mutation (a and p-Thalassemia), a 
single base substitution (Sickle Cell Anemia), a deletion 
(Cystic Fibrosis AFjog? Tay-Sachs), an insertion, a 
duplication, a transposition of bases or a combination of the 
above. Labeled oligomers are selected for ligation to a 
probe/primer such that the resulting extended primer is a 
labeled mutation-specific polynucleotide. 

The methods of the present invention can be used to 
provide a method for the differentiation of heterozygotes 
from homozygoles for such a genetic condition. Since two 
copies of a chromosome containing a DNA sequence of 
interest are present in a sample, the method of synthesizing 
labeled complementary DNA provides a means for distin- 
guishing heterozygotes from either homozygote. A set of 
oligomers carrying a first label which, when ligated, produce 
a portion of a strand complementary to the normal sequence 
is provided for ligation. Another set of oligomers carrying a 
second label produces a portion of a strand complementary 
to the mutant sequence upon ligation (FIG. 4). Ligating the 
sets of oligomers to a probe hybridized to target DNA in the 
sample creates a polynucleotide complementary to the 
sample genotype. The three genotypes are resolved by the 
determining which labels are present in the newly synthe- 
sized DNA Homozygous DNA wiU contain one label or the 
other, heterozygous DNA will contain both. 

When the sequence to be synthesized is not known, a 
library of a large number of the total possible pool of 
oligomers is used. The latter situation occurs in sequence 
analysis and mutation screening. When used in conjunction 
with the methods of ascertaining the base sequence of a 
newly synthesized polynucleotide described in detail below, 
numerous mutations of a particular gene can be analyzed 
and identified simultaneously. The ability to test for multiple 
mutations in a gene would enable screening for genetic 
diseases such as cystic fibrosis for which more than 500 
mutations have been identified. 

In yet another aspect, there is provided a method of 
synthesizing an immobilized single stranded nucleic acid 
having a region whose base sequence is complementary to 
a portion of the base sequence of a test nucleic acid 
comprising: 

a) providing a capture probe/primer which is complemen- 
tary to a portion of the test single stranded nucleic acid; 

b) contacting the capture probe/primer with the test single 
stranded nucleic acid under hybridizing conditions to 
capture the test single stranded nucleic acid and form a 
captured probe -test nucleic acid hybrid having a single 
stranded region and a double stranded region; 

c) contacting the captured hybrid with a plurality of 
oligonucleotide 5'-monophosphates; 

d) hgating at least some of the plurality of oligonucleotide 
5*-monophosphates to the capture probe/primer to 
extend the double stranded region; 

e) removing the oligonucleotide 5'-monophosphates 
which are not ligated; and 



45 



50 



f) denaturing the captured probe-test nucleic acid hybrid 
having an extended double stranded region to remove 
the test nucleic acid from the solid support and produce 
the immobilized single stranded nucleic acid. 

The method controls the point of origin and is not limited 
by the size of oligomers to be Ligated. When the sequence to 
be transcribed is exactly known, the oligomers can be 
pre-selected to reduce cost and complexity. 

Another aspect of the invention is a method of synthe- 
sizing multiply labeled nucleic acid where the extent of label 
incorporation is controlled and provides a high density of 
labeling. When using an immobilized primer to serve the 
dual purpose of capture probe and primer the process 
comprises: a method of synthesizing an immobilized mul- 
tiply labeled single stranded nucleic acid comprising: 

a) providing a capture probe/primer which is complemen- 
tary to a portion of the test single stranded nucleic acid; 

b) contacting the capture probe/primer with the test single 
stranded nucleic acid under hybridizing conditions to 
capture the test single stranded nucleic acid and form a 
captured probe-test nucleic acid hybrid having a single 
stranded region and a double stranded region; 

c) contacting the captured hybrid with a plurality of 
labeled oligonucleotide 5' -monophosphates; 

d) ligating at least some of the plurality of labeled 
oligonucleotide 5'-monophosphates to the capture 
probe/primer to form a captured probe-test nucleic add 
hybrid having an extended double stranded region; 

e) removing the labeled oligonucleotide 
5'-monophosphates which are not ligated; and 

f) denaturing the captured probe-test nucleic acid hybrid 
having an extended double stranded region to remove 
the test nucleic add from the solid support and produce 
an immobilized labeled single stranded nucleic acid 
containmg a plurality of labels. 

The primer can alternatively be a nonimmobilized primer 
for the purposes of synthesizing a multiply labeled nucleic 
add. This embodiment comprises: 

a) providing a primer which is complementary to a 
portion of a test single stranded nucleic add; 

b) contacting the primer with the test single stranded 
nucleic add under hybridizing conditions to form a 
primer-test nucleic acid hybrid having a single stranded 
region and a double stranded region; 

c) contacting the hybrid with a plurality of labeled oligo- 
nucleotide S'-monophosphates; 

d) ligating at least some of the plurality of labeled 
oligonucleotide 5'-monaphosphates to the primer to 
form an extended primer-test nucleic add hybrid hav- 
ing an extended double stranded regior^ and 

e) removing the labeled oligonucleotide 
5'-monophosphates which are not ligated. 

The method can further comprise the step of separating 
the extended primei strand from the template nucleic add 
strand if desired. The label borne on each oligonucleotide 
5'-monophosphate can be different or all can be the same 
label. Alternatively, a limited number of different labels, e.g. 
2-5 labels, can be employed. The choice of labels used will 
be governed by the final application. 

The present methods, in contrast to other methods of 
labeling nucleic acids described in the Background section, 
can prepare virtually any length nucleic acid, but would 
probably be most useful for products of at least about 50 
bases. Shorter products would have less labels attached. One 
of the main advantages is that the degree and position of 
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label attachment is precisely controlled. For example, pen- 
tamers bearing one label each lead to product in which every 
fifth base is labeled, providing a label density of 20%. Still 
higher densities can be achieved with shorter oligomers or 
with pentamers bearing two or more labels each. 

The ability to controUably label at these high densities 
will be particularly advantageous in diagaostic tests where 
detection sensitivity is paramount. Higher label densities 
should translate to improved limits of detection. Controlled 
labeling will contribute to improved assay precision. The 
labels can be virtually detectable species, including 
radioisotopes, chemiluminescent labels and fluorescent 
labels, calorimetric labels detected on the basis of absorption 
of light, specific binding molecules including antigens and 
antibodies, binding proteins such as streptavidin and haptens 15 
such as biotin and digoxigenin. In addition, when the label 
is a smaU hapten, the detectable label can be a species such 
as an enzyme which is bound to the nucleic acid via an 
enzyme-anti-hapten conjugate. In the latter regard, the use of 
peatamer ligation to produce labeled nucleic acid provides 20 
still another advantage. The bulky enzyme labels would be 
attached at every fifth base, which places them at nearly 
ISO** angles along the double helbc from the nearest neigh- 
boring label Consideration of the intemucleotide separation 
and molecular diameters of enzymes, reveals that even 25 
relatively large globular proteins can be accommodated at 
this labeling density without severe steric congestion. 

Still higher label densities can be achieved by adopting 
the branching label principle in conjunction with the incor- 
poration of regular labeled oligomers as depicted in FIGS. 5 30 
and 6. In practice, some or all of the oUgomers would 
constitute a "handle" such as a hapten or short recognition 
sequence which is uised to bind to a branched amplification 
multimer. 

Alternately, the arms of the branches could be prepared by 35 
the ligation of labeled short oligomers, so that each of the 
multiple arms carries detectable labels. Synthesis of densely 
labeled nucleic acids by ligation of labeled oligomers can be 
adapted to other types of branching DNA technology such as 
the DNA dendrimers (Polyprobe, Philadelphia). 40 

The oligonucleotide 5'-phosphates used in the above- 
disclosed methods of synthesis, amplification, preparing 
labeled polynucleotides or immobilized polynucleotides are 
preferably relatively short. In these applications, it is not 
necessary to use a substantial fraction of the total library of 45 
oligomers of a given length in order to be able to synthesize 
the desired nucleic acid of known sequence. The size of the 
ohgomers can take any convenient value, typically from 2 to 
about 20 bases. When high density labeling is desired, it is 
preferred that the oligomers coatain less than about 10 bases 
and preferably from about 4 to about 8 bases. 

In another aspect of the invention, methods are provided 
for determining the sequence (sequencing) of an unknown 
single stranded nucleic acid. The method can be applied to 



is to provide a biotin label for binding to a streptavidin- 
coated support. Streptavidin-coated beads and microtiter 
plates are commercially available. 

The oligonucleotide 5'-phosphates used in sequence 
5 analysis determinations performed in accordance with the 
methods disclosed herein are preferably relatively short. It is 
necessary to use a substantial fraction of all possible oligo- 
mers of a given length in order to be able to synthesize long 
stretches of nucleic acid of unknown sequence. In order to 
10 keep the total library size manageable, it is desirable to limit 
the size of the oligomer to less than about 8 bases. It is more 
preferred that the oligomers contaia 5 or 6 bases. A further 
requirement in embodiments involving sequence analysis is 
that all oligonucleotide 5'-phosphates be of the same number 
15 of bases. 

In this aspect of the invention, a method for determining 
the sequence of a portion of a single stranded nucleic acid 
comprises the steps of: 

a) providing a capture probe/primer which is complemen- 
tary to a portion of the single stranded nucleic acid; 

b) hybridizing the capture probe/primer with the single 
stranded nucleic acid to capture the single stranded 
nucleic add and form a capmred probe-nucleic acid 
hybrid having a single stranded region and a double 
stranded region; 

c) contacting the captured hybrid with a plurahty of 
labeled oligonucleotide 5'-monophosphates of the same 
number of bases each oligonucleotide 
5'-monophosphate having a unique label; 

d) ligating at least some of the pluraHty of labeled 
oligonucleotide 5'-monophosphates to the capture 
probe/primer to form a captured probe-nucleic acid 
hybrid having an extended double stranded region; 

e) removing the labeled oligonucleotide 
5'-monophosphates which are not ligated; 

f) denaturing the captured probe-nucleic acid hybrid hav- 
ing an extended double stranded region to remove the 
nucleic acid from the sotid support and produce an 
immobilized complementary single stranded nucleic 
acid containing a plurality of labels and a region whose 
sequence is complementary to a region of the nucleic 
acid; 

g) detecting the plurality of labels; 

h) relating the plurality of detected labels to the identity 
of their corresponding oligonucleotide 
5'-monophosphates; and 

i) determining the base sequence of tiie portion of the 
nucleic acid from the identity of the plurality of oligo- 
nucleotide 5*-monopho6phates. 

The process of converting the collection of partial base 
sequences derived from the plurality of deteaed labels 
involves performing a set of analyses to relate the collection 
of partial base sequences to their correct relative order or 



50 



RNA, ssDNA and denatured dsDNA sequences of suitable 55 position in the total sequence to be determined. A subset of 



lengths provided that at least a portion of the sequence is 
known. The latter restriction is necessary in order that a 
capture probe/primer may be designed. 

The capture probe/primer is immobilized or capable of 
being immobilized onto a solid support such as a bead, tube, 
filter, membrane microtiter plate or chip. The capture probe 
should be of sufficient base length to guarantee efGcient 
hybridization and represent a unique partial sequence on the 
test nucleic acid. These conditions will generally be satisfied 



partial base sequences is identified in the initial ligation 
experiment. The order of occurrence of each partial 
sequence in the full sequence is then deduced from a set of 
experiments in which one oligomer representing one partial 
sequence is excluded from the set of all identified partial 
sequences. For a nucleic acid sequence of N bases, the 
number of identified partial sequences of n bases would be 
N/n, assuming no duplicates. The number of such sets, each 
containing (N/n)-l oligomers and lacking a different 



with a length of at least 10 bases and preferably at least 15 65 oHgomer, equals the number of partial sequences, N/n. The 
bases. The capture probe can be immobilized onto the solid ligation reaction of these sets to the hybridized primer 
support in any art-recognized way. A commonly used means produces a collection of extended primers of different 
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lengths, ranging from 0 to N/n additional sets of n bases plus 
the length of the primer, i.e. primer+n, +2n, +3n, etc. Since 
the identity (sequence) of the excluded oligomer is known 
for each experiment, its relative position in the total 
sequence is given by the formula: 1+number of additional 
n-base units which were incorporated in that experiment. 

There are several ways in which the reaction product of 
each of the aforementioned N/n experiments can be identi- 
fied. Each method constitutes a different embodiment of the 
invention. In one embodiment, unique cleavable labels are 
provided on each oligpmer. Hie plurality of unique labels is 
cleaved from the extended probe/primer to produce a set of 
label fragments, each having a imique molecular mass. This 
method of sequence analysis is depicted schematically in 
FIG. 7. The set of label fragments is analyzed by introduc- 
tion into a mass spectrometer. In a preferred mode, the mass 
analysis is performed under conditions where the parent ion 
of each label fragment can be detected. The experimental 
output of each experiment consists of a set of molecular 
masses, each set containing a different number of values. 
The collection of sets of molecular masses is compared to 
determine the relative position of each unique label and its 
associated partial base sequence in the total sequence being 
determined. This analysis is most conveniently done by a 
computer algorithm. 

The cleavable labels can be any molecular fragment 
capable of being controllably released from the extended 
probe/primer. Preferred labels are small organic molecules 
of molecular mass less than about 50,000 amu. It is desirable 
that the labels all be of one structural type, having a common 
functional group so that all are cleavable by a common 
means. One means for effecting cleavage is by thermolysis 
of a thermally labile group. A preferred thermally labile 
group for use in cleavable labels is a 1,2-dioxetane. It is well 
known that 1,2-dioxetanes undergo a thermal fragmentation 
of the dioxetane ring to produce two carbonyl fragments. 
Dioxetane-labeled oligomers can be prepared which release 
a carbonyl compound when heated by tediering a dioxetane 
group to a ribonucleotide or deoxyribonucleotide. 

o — o o o 
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reductive cleavage of a disulfide or peroxide, oxidative 
cleavage of alkenes or diols. 

An exemplary enzymatic reaction for label cleavage uti- 
lizes enzymaticaliy triggerable dioxetanes as labels. Enzy- 
matic deprotection of a protected phenolic substituent trig- 
gers cleavage of the dioxetane ring into two carbonyl 
compounds as depicted above. The reaction can be per- 
formed at room temperature and the rate of cleavage con- 
trolled by the amount and nature of the triggering enzyme 
and the characteristics of the reaction solution, e.g. pH. 
Numerous triggerable dioxetane structures are weU known 
in the art and have been the subject of numerous patents. The 
spiroadamantyl-stabilized dioxetanes disclosed in U.S. Pat. 
No. 5,707,559 are one example, others containing alkyl or 
cycloaikyl substituents as disclosed in U.S. Pat. No. 5,578, 
253 would also be suitable. A linking substituent from the 
aforementioned spiroadamantyl, alkyl or cycloaikyl groups 
would be required to attach the dioxetane label to the 
oligomer. Linkable dioxetanes are disclosed in U.S. Pat. No. 
5,770,743. 




A library of oligomers would comprise the set of all 
possible sequences of n bases, each covalently attached to a 
unique dioxetane moiety. For convenience of synthesis, the 

linking functionality connecting the dioxetane ring group to 50 of oligomers identified in the preliminary analysis to relate 



Chemical methods of cleaving triggerable dioxetanes are 
ako well known and would be similarly useful in the 
methods of the invention. In the example above, X can be a 
trialkylsilyl group and the triggering agent fluoride. Other 
triggering agent/cleavable group pairs are described in, for 
example, the aforementioned 5,707,559, 5,578,253 and 
5,770,743 patents. 

The foregoing method comprised the steps of: 

1) performing an initial ligation experiment with labeled 
oligomers, 

2) releasing the labels, 3) detecting the labels, 4) deter- 
mining the set of partial base sequences associated with the 
labels, 5) performing a set of ligation reactions with a subset 



the oligomer should be common to all members of the 
family of labeled oligomers. Substituents on the dioxetane 
ring at the carbon which is cleaved will vary among the 
members of the set of compounds. 

Other thermally cleavable functional groups such as non- 
cyclic peroxides are known and can be used. The tempera- 
ture required for thermolysis must be low enough so that 
oligonucleotide fragmentation does not occur. 

The means of cleaving the cleavable label is not limited 
to thermal cleavage. Any means of controllably releasing the 
label from the extended probe/primer can be employed. 
Other means include, without limitation, enzymatic 
reactions, chemical reactions including nucleophilic dis- 
placements such as fluoride-induced siiyl ether cleavage, 
basic or acidic hydrolytic fragmentations such as ester 
hydrolysis or vinyl ether hydrolysis, photochemical 
fragmentations, reductive cleavage such as metal-induced 



the collection of partial base sequences to their correct 
relative order or position in the total sequence. Alternatively, 
the preliminary ligation and analysis can be omitted and the 
sequence can be determined by performing sets of ligation 
55 reactions, excluding one oligomer in each set. In this mode, 
the sets would need to comprise the whole library of partial 
sequences less the excluded one. Detection m6/ox quanti- 
tation of the labels is then performed in the same manner as 
described above. 
60 In the foregoing methods where arrays are prepared 
lacking one oligomer from the set of oligomers, an alterna- 
tive approach would be to incorporate nonextendable oli- 
gomers for the particular oligomer which would otherwise 
be excluded. The nonextendable oligomer can be labeled or 
65 unlabeled, depending on the need. Such nonextendable 
oligomers could have a dideoxy base at the 3 -end of the 
oligomer so that there is no 3'-0H for ligation. The 3'-0H 
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could be blocked, for example with a methyl group or a mined by measuring the action of the enzyme on a 

phosphate group, to prevent subsequent ligation. Modifica- fluorogenic, chromogenic or chemiluminogenic substrate, 

tions to the terminal base which prevent ligation are another The quantitative detection techniques described above 

possibility. rely on the ability to discriminate signal from 0 to N/n with 

In another aspect, the method of Hgating oHgomers onto 5 ^ait resolution where N is the total number of bases to be 

a template-bound primer for the purpose of sequence analy- sequenced and n is the number of bases in the oHgomer 

sis can be performed using a single label with quantitative phosphates i^ed. The resolution demand of the detection 

analysis. The process of determining the sequence can be ^^.''^ ^^f^^ by performing m prallel sete of 

achieved by performing a set of ligation reactions each ^^^^^^ °^y/ predetenmned fraction(l/m) of the 

^ T, f T c , _ ohgomers are labeled. If m sets of experiments arc then 

reaction contaming the full ibrary of ohgonjers of n bases 10 f^^^^^ ^ ^^^^ ^ ^^^^^^^ .^^ ^^^^^ 

less one as described generally above. Each ohgomer carries ^ ^^^^^^^ ^ ^^^^ ^^^^ unlabeled, ligation of the 

the same detectable label. Each hgation reaction produces an j^^^ detection produces a set of values in the range 0 

extended primer of a length, ranging from 0 to N/n addi- ^ ^^^^ experiments. The sum of the 

tional sets of n bases plus the length of the primer, i.e. information in the m sets combines to produce the same 

primer+n, +2n, +3n, etc. The quantity of detectable label in 15 information (total sequence). This reduces the measurement 

each reaction is proportional to the number of ligated precision requirement and provides m-fold redundancy of 

oligomers. Collectively the set of extended primers produces results. As an example using pentameric oligomers, pentam- 

all values from 0 to N/n with the maximum value resulting ers arbitrarily designated 1-205 would be labeled and the 

in all reactions in which the excluded oligomer is not present rest unlabeled in the first set. Numbers 206-410 would be 

in the template sequence. The value of 0 occurs when the 20 labeled in a second set, numbers 411-615 in a third, 

excluded ohgomer represents the first five bases in the 616-820 in a fourth and 821-1024 in a fifth set. Each of the 

template sequence. Since the identity (sequence) of the five sets of ligations will produce data with numeric values 

excluded ohgomer is known for each experiment, its relative firom 0 to N/5 m. The individual reactions responsible for 

position in the total sequence is given by the formula: producing these values will differ among the five sets. 

1+number of additional n-base units which were mcorpo- 25 In still a further embodiment, unlabeled ohgomers can be 

rated in that experiment. The template sequence is then used in a method for sequencing by ligation when applied to 

deduced from the order of occurrence of each partial polynucleotides of up to a few hundred bases. Methods of 

sequence (oligomer). DNA sequence analysis using MALDI-TOF mass spectrom- 

There are several ways in which the label can be detected. etry have been developed to accurately determine the 

Each method or type of label constitutes a different embodi- 30 molecular mass of a series of polynucleotides differing in 

ment of the mvention. In one embodiment, the label is a length by one base generated by exonuclease digestion of a 

fluorescent molecule such as the fluorescers FAM, JOE, nucleic acid. The technique is easily capable of discriminat- 

ROX and TAMRA commonly used in automated dideoxy ing polynucleotides differing in length by 5 bases on the 

sequencing. Numerous methods of labeling nucleotides and basis of molecular mass. Current technology can accurately 

oHgonucleotides are known in the art and include direct 35 identify polynucleotides up to about 80-100 bases with 

attachment of label (Haugland, Handbook of Fluorescent adequate (single base) mass resolution. A series of ligated 

Probes and Research Chemicals, (Molecular Probes, polynucleotide products formed in accordance with the 

Eugene, Ore.), 1992). Labeling can also be accomplished by methods of the present invention containing from 0 to about 

indirect means where, for example, where a universal Hnker 100 l%ated short oligonucleotides, such as pentamers, would 

such as biotin is provided as the primary label and a 40 require no better instrumental resolution and would extend 

fluorescer-labeled binding partner for biotin provides the the mass range which could be sequenced several fold, 

label. Another aspect of the invention comprises a method of 

In another embodiment, the label is a chemiluminescent detecting a target nucleic acid by detecting a labeled 

compound and the quantity of label is detected by the light extended nucleic acid which is complementary to the target, 

intensity produced by triggering the generation of chemilu- 45 the method being a simpler alternative than traditional 

minescence from the label. Several types of chemilumines- Southern and northern blotting. Preparation of the labeled 

cent compounds are known and can be used as labels. extended complementary nucleic acid is performed by Hga- 

Representative examples include acridiniiim esters and tion of a pluraUty of labeled short oHgomers onto a probe/ 

sulfonamides, luminol or isoluminol derivatives, and diox- primer which is hybridized to the target. Extension is 

etanes(R. Handley, H.Akhavan-Tafti,A. P. Schaap,7. C/in. 50 followed by denaturing electrophoretic separation and 

Ligand Assay, 20(4) 302-312 (1997)). A preferred chemi- detection of the labeled species. The presence of the labeled 

luminescent label is an acridan phosphate compound as extended primer is indicative of the presence of the target 

disclosed in ^plicant's co-pending appHcation Ser. No. sinceligationonly takes place when the primer is hybridized 

09/099,656. The latter compounds are used advantageously to the target. It is preferred that the label is detectable in the 

because of their stabihty, high chemiluminescence quantum 55 gel. Suitable labels include acridan alkenes as described in 

efficiency, ease of conjugation and ability to be triggered U.S. Patent application Ser. No. 09/099,656 filed on Jun. 17, 

under a wide range of conditions, including in electrophore- 1998, which can be detected by chemiltmiinescence, and 

sis gels. Bioluminescent and electrochemiluminescent com- fluorescers which are readily detectable in gels. In this 

pounds are considered within the scope of detectable chemi- embodiment, no blottii^ is performed. If the label is such 

luminescent labels. 60 that detection in the gel is not feasible, then blotting onto 

In another embodiment, the label is a chromogenic com- membrane is performed and then detection of the label is 

pound and the quantity of label is detected by Ught absor- performed on the membrane. In no case is hybridization on 

bance. Another label type is a radioisotope such as ^^P and the membrane, antibody binding, enzyme-conjugate 

^^S whose presence can be detected using scintillation binding, substrate addition or other commonly used methods 

counting or x-ray imaging. The label can also be an enzyme 65 necessary. 

such as alkaline phosphatase, P-galactosidase, luciferase and In an alternate embodiment of this method, the labeled 

horseradish peroxidase. The quantity of enzyme is deter- extended complementary nucleic add remains hybridized to 
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the target and after electrophoresis, the band detected at the 
appropriate molecular weight in the manner described 
above. This mode may be desirable when accurate molecular 
weight information is needed. In this method it is more 
convenient to provide substantially the full library of pos- 
sible oligomer 5'-phosphates of n bases when the target is 
much longer than the probe/primer. In cases where the target 
sequence is known and where its length makes it more 
practical, it may be preferred to preselect the subset of 
oligonucleotide 5 -phosphates. 

Yet another embodiment comprises providing a suitable 
fluorescent donor as a label on a probe/primer and a suitable 
fluorescent acceptor as a label on the oligomers. It is not 
necessary to label each oligomer. Ligation is performed on 
hybridized primer to form an extended primer bearing a 
fluorescent donor and one or more fluorescent acceptor 
labels. Under suitable conditions, i.e. when the douor and 
acceptor possess sufficient spectral overlap for energy trans- 
fer to be feasible and the spatial separation between donor 
and acceptors are within the Forster distance, energy transfer 
between fiuorescers can occur within the extended primer. 
Inadiation of the extended primer at a wavelength absorbed 
by the fluorescent donor on the primer results in fluores- 
cence from the acceptor on the extended portion. This 
method can therefore serve as the basis for a homogeneous 
assay for detecting a target nucleic acid since the presence 
of target is required to permit the Hgation to occur and 
thereby bring the fluorophores within energy transfer dis- 
tance. 

Another method for detecting a target nucleic acid based 
on the ligation of a plurality of labeled oligomers comprises 
using a fluorescent intercalating dye as a label. It is known 
that certain dyes become fluorescent when intercalated 
within the double helix of double stranded nucleic adds. An 
example is the widely used compoimd ethidium bromide. 
Accordingly, a method for detecting a target nucleic add 
comprises: 

a) providing plurality of oligonucleotide 5'-phosphates 
wherein at least some contain a fluorescent intercalat- 
ing dye as a label; 

b) providing an oligonucleotide primer which is comple- 
mentary to a portion of a target nucleic add; 

c) contacting the primer with the target nucleic add under 
hybridizing conditions to form a primer-target duplex 
having a single stranded region and a double stranded 
region; 

c) contacting the duplex with the plurality of oligonucle- 
otide 5*-monophosphates; 

d) ligating at least some of the plurality of oligonucleotide 
5 -monophosphates to the duplex to extend the double 
stranded region; 

e) detecting fluorescence from the intercalated bound 
label. 

As an optional step, agarose can be added to the reaction 
to enhance fluorescence. In the absence of target, Hgation 
does not occur, so the detection of fluorescence is evidence 
of the presence of the target and additionally is evidence that 
the primer was sufficiently complementary to the target to 
hybridize. The collection of oligomers can be the full library 
of all possible sequences, or a subset containing preselected 
members if the target sequence is known. 

The fraction of labeled oligomers to use can be selected 
empirically with regard to the desired degree of detection 
sensitivity by using a range of different label densities. It 
may be desirable, depending on the size of the oUgonucle- 
otide 5*-phosphates, to limit the fraction of labeled ohgo- 
mers to avoid self quenching of fluorescence. 



1,614 

18 

Another aspect of the present invention comprises a 
library of short oligonucleotide 5*-phosphates. It is preferred 
that the oligonucleotides consist of 5 bases or less, with 
pentamers being more preferred. The number of pentamers 
5 required for the fiill library is 4^ or 1024 individual com- 
pounds. 

In practice it may not be necessary to include all oligo- 
nucleotide 5-phosphates of length n in forming a library. In 
appHcations where the number of oligonucleotides required 

10 to produce the given sequence of N nucleotides is small 
compared to the total number of oligonucleotides in the 
library (N/n«4"), partial libraries will often sufiSce to 
maintain a high probability of providing aU of the required 
ohgonucleotides. As an illustration of this point, a poly- 

15 nucleotide of 500 bases consists of 100 pentameric units 
The fiill library of all pentamers contains 1024 compounds, 
a more than 10-fold excess. 

It may be desirable to use a partial library which excludes 
selected sequence oligonucleotides which hybridize too 

20 weakly or strongly. In several methods described above for 
sequence analysis of a test nucleic acid, the library will be 
predetermined to consist of a selected number, x, of oligo- 
mers determined in a proceeding step from the identification 
of x+1 labels. In the sequencing method, a collection of 

25 partial libraries of x bases each will be used. Each partial 
library will lack a different one of the x+1 oligomers 
identified on the basis of the preceding step. 

The partial Ubraries can be preformed by preparing all of 
the possible combinations of x oligomers beforehand. 

30 Alternatively, the partial libraries can be assembled as 
needed from the individual ohgomers. The assembly of such 
partial libraries can be accomplished by robotic workstations 
with automatic fluid handling capabilities. 
In general, the library of oligonucleotide 

35 5'-monophosphates will contain labeled oligonucleotide 
5 -monophosphates, in particular those bearing detectable 
labels. In some uses, all of the members of the library will 
bear a detectable label. In other applications, a preselected 
fraction of the members will be labeled. An example is the 

40 method of quantitative analysis disclosed above where, for 
example, five libraries of all possible oligomers are formed, 
each hbiary having a different one-fifth fraction of the 
members being labeled. 
In another embodiment of a library, at least one of the 

45 constituents of a library is a nonextendable oligomer. The 
nonextendable oHgomer can be labeled or unlabeled. Such 
nonextendable oligomers could have a dideoxy base at the 
3-end of the oligomer so that there is no 3'-0H for ligation. 
The 3'-0H could be blocked, for example with a methyl 

50 group or a phosphate group, to prevent subsequent ligation. 
Modifications to the terminal base which prevent ligation are 
another possibflity. 

Synthesis of oligomers — Oligonucleotides are readily 
synfliesized using standard methods of synthesis well known 

55 to those of skill in the art including, e.g., phosphoramidate 
chemistry. Phosphorylation of oligonucleotides is performed 
using a polynucleotide kinase and ATP or by chemical 
methods of phosphorylation as described in (L. A. Slotin, 
Synthesis, 737-752 (1977); T. Horn, M. Urdea, Tetrahedon 

60 Lett., 27, 4705^708 (1986)). A kit is commercially avail- 
able for carrying out 5'-phosphorylation (Phosphate-ON, 
Ciontech, Palo Alto, Calif.). 

Methods for the automated synthesis of oligonucleotides 
are well known in the art and in common commercial use. 

65 A common method uses a solid support of immobilization 
and automated reagent handhng to add nucleotides sequen- 
tially. All addition, blocking and deblocking steps are imder 
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computer control. Such instruments are available from sev- 
eral commercial suppliers such as Applied Biosystems, CA 
(Model 392 and 394). Automated instruments for transfer of 
liquid reagents and samples can be performed imder com- 
puter control using laboratory robots such as are commer- 5 
cially available (Perkin-EbiLer, model 800 Catalyst, Beck- 
man Instruments Biomek). Newer techniques for the high 
speed synthesis or synthesis of large numbers of oligonucle- 
otides utilize photolithographic techniques or ink jet tech- 
nology for the rapid and precise delivery of reagents and lo 
reactants. 

Still a further aspect of the invention comprises annealing 
a primer oligonucleotide 5=-phosphate to a single-stranded 
template and, in the manner disclosed above, ligating a 
library of oligomers to extend the piimer from both ends to 15 
duplicate the template strand. The method is useM in a 
method to render a single stranded template double stranded 
in order to clone it. This would find utility in methods for 
isolating related genes or gene families. 

In an exemplary method, a primer oligonucleotide is 20 
hybridized to a template strand in the presence of a library 
of all possible combinations of pentamers, a DNA ligase, 
and an appropriate reaction buffer. Pentamers that are 
complementary to the template strand, and in exact register 
with the 5' and 3* ends of the primer oligonucleotide, anneal 25 
and are sequentially ligated by the action of the DNAligase. 
The template strand thereby becomes substantially copied or 
rendered double-stranded. This procedure can be used to 
detect target templates in a mixture of nucleic acid strands 
and to prepare double-stranded nucleic acids for cloning 30 
using cloning vectors and techniques known in the art. 

In order to more fully describe various aspects of the 
present invention, the following examples are presented 
which do not limit the scope of the invention in any way. 

EXAMPLES 

Example 1. General Procediu-e 

The template used in this experiment was a PCR ampli- 
fied product (200 bp) of exon 10 region of the cystic fibrosis 4q 
transmembrane regulator (CFTR) gene. The PCR-amplified 
DNA of the CFTR gene was purified either by running it 
through a column (Qiaquick PCR purification kit, Qiagen, 
Santa Qarita, Calif.) or by ethanol precipitation. The DNA 
was resiispended in distilled water at a concentration of 45 
approximately 0.5 fjglf£L. Pentamers bearing a 5'-phosphate 
group and primers were obtained commercially (Oligos Etc., 
WilsonviUe, Ore.). 

The primer and pentamers were designed to be compli- 
mentary to either iJie sense or the antisense strand of the 
template used. The length of the primer used in these 
experiments ranged from 21 to 26 nucleotides. The pentam- 
ers were designed in such a way that the first pentamer 
anneals to the template immediately adjacent to the 3* end of 
the primer. The subsequent primers line up contiguously 
starting at the 3' end of the first pentamer. Hybridization of 
the primer and pentamers to the template followed by 
ligation by T4 DNA ligase results in back to back ligation at 
the 5-3' junctions. To enable the detection of the ligated 
primer-pentamer products, biotin-dUTP labeled pentamers 
(at the internal dTTP position) were used. 

Hybridization of the primer and pentamers to the template 
and their ligation to each other was accomplished in a 3 -step 
process. First, the temp late -primer-pentamer mix was heated 
lo 94° C. and kept for 5 min to allow the denaturation of the 
double stranded template. The mix was cooled to 60"* C. or 
65** C, depending on the size and base composition of the 
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primer, to anneal the primer to the template for 2 min. 
Finally, the reaction tubes were cooled to 16* C. After about 
2 min at 16° C, ligation buffer (66 mM Tris HCl, pH 7.6, 6.6 
mM Mgds, 10 mM DTT, 66 ATP, Amersham) and T4 
DNAHgase, 1 U (Amersham, 1:10 dilution) were added and 
ligated at 16° C. for 2 hours. The ligation reaction was 
stopped by adding Vioth volume of loading dye (0.01% 
xylene cyanol and 0.01% bromophenol blue, and O.OIM 
EDTA in deionized formamide). 

The ligation reactions were electrophoresed on a dena- 
turing polyacrylamide gel along with biotin-labeled oligo- 
nucleotide size markers. The DNA was capillary transferred 
to a nylon membrane, bound with anti-biotin antibody-HRP 
conjugate, and detected by reacting with Lumigen PS-3 (a 
chemiluminescent HRP substrate) and exposing to an x-ray 
fihn. The size of the ligated product varies depending on the 
number of pentamers ligated to the primer. 

Example 2. Determining Optimal Concentrations of 
Template, Primer and Pentamers. 

Template: A 200 bp PCR product of CFTR exon 10 (See 
the attachment for the template DNA sequence) was 
obtained by PCR amplification using a set of sense (5' 
ACrrCACTTCTAATGArGATTArG 3') (Seq. ID #1) and 
an antisense (5= CrCTTdAGTTGGCATGCTTTGAr 3') 
(Seq. ID #2) primers. 

A 26 base oligonucleotide complementary to the sense 
strand of the template DNA was designed as a primer (5' 
AGTGGAAGAAnTCArrCTGTrCTCA3') (Seq. ID #3). 

Pentamers: Six 5-base long oligonucleotide 5'-phosphates 
complementary to the sense strand immediately adjacent the 
3' end of the primer were prepared. The 5' end of the first 
pentamer aligns immediately next to the 3' end of the primer, 
the 5' end of the second pentamer aligns immediately next 
the 3* end of the first pentamer and so on. To facilitate 
ligation, the 5' end of each pentamer was phosphorylated. To 
enable the detection of the hgation products, pentamers 1 
and 3 were labeled with biotin-dUTP at the central dTTP 
position and the last pentamer was labeled with biotin at the 
3' end. The pentamers were as follows: 
Pentamer 1: 5' PO4-GTTTT 3' 
Pentamer 2: 5' P04-CCU*GG 3' U*=U-Biotin 
Pentamer 3: 5' P04-ArrAr 3' 
Pentamer 4: 5' P04-GCCU*G 3' 
Pentamer 5: 5' PO4-GCACC 3' 
Pentamer 6: 5' PO4-AITAA 3'-Biotin. 

Ligations were performed using T4 DNA ligase and 
hgation buffer (Amersham), according to the ligation oon- 
50 ditions described in Example 1. The ligations were per- 
formed in a volume of 20 f^L. The amount of template was 
kept constant at about 1 jUg per reaction. The amount of 
primer was varied from 100 ng to 1 pg between reactions. 
The amount of each pentamer was varied from 2 ng to 0.2 
55 pg in each reaction. The reaction with Ifigoi template, 100 
ng of primer, and 2 ng of each pentamer contained approxi- 
mately equimolar concentrations of the template, primer and 
pentamers. The primer and pentamers were varied system- 
atically to determine the lowest amount of detectable liga- 
60 tion product. 

The ligation reactions were electrophoresed, capiUary 
transferred to a nylon membrane, bound with anti-biotin 
antibody-HRP conjugate, and detected with Lumigen PS-3 
as described in Example 1. A fall-length primer-pentamer 
65 Hgation product of expected size (56 bp) was detected in the 
ligation reaction containing 100 ng of primer and 20 ng of 
each pentamer (0.6 pM}. Lower concentrations of the primer 
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Pentamer 5: 5' P04-ArTAr 3' Pentamer Sa: 5= PO4- 

TTATG 3' 

Pentamer 6: 5' P04-GCCU*G 3' Pentamer 6a: 5' PO4- 

CCTGG 3' 
Pentamer 7: 5' PO4-GCACC 3' 
Pentamer 8: 5' P04-ArTAA 3'-biotia 

The target region of the template comprises the sequence: 
y TAA TTC GTG TCA CCT TCT TAA AGT AAG ACA 
AGA GTC AAA AGG ACC TAA TAG OCA CCG TGG 
TAATr5'(SeqID#5) 



and pentamers m the ligation reaction yielded very low 
amount or no detectable ligation product under these con- 
ditions. 

Example 3. Ligations with varying number of 
pentamers. 

To show that the pentamers are sequentially ligated to the 
primer starting j5rom the first pentamer (immediately down- 
stream from the primer), ligations were performed using the 
template, primer and pentamers of Example 2 by increment- 
ing the number of pentamers in each reaction. All the 
reactions contained equimolar concentrations (0.6 ^M) of 
the template, primer and each pentamer. The ligations were 
performed as described above and the products detected by 
binding with anti-biotin-HRP antibody and reacting with 
Lumigen PS-3 substrate. 

As expected, the size of the ligation product increased 
incrementally by five bases with the addition of each pen- 
tamer starting from the first pentamer and so on. There was 
no ligation product in the absence of the first pentamer and 
with the rest of the pentamers in the reaction demonstrating 
the requirement of the primer and specificity of the pentam- 
ers for the ligation to occur. In one ligation reaction con- 

taining the first four pentamers, there were two bands of the Evenin thepreseoceof a set of six competing pentamers. the 
hgated product one of which was the expected size and the gg^tion products formed were the result of ligation of the 



Reaction 


Pentamers Used 


Product Length 


1 


1, la-6a 


31 (pruner + 5) 


2 


1, 2, la-6a 


36 


3 


1-3, la-6a 


41 


4 


1-4, la-6a 


46 


5 


1-5, la-6a 


51 


6 


1-6, la-6a 


56 


7 


1-7, la-6a 


61 


8 


1-8, la-6a 


66 


9 


2-8, la 


26 (primer) 


10 


1-^, la-6a (no primer) 


none 



Other was the size expected when all five pentamers are 
present in the reaction. Comparing the sequences of the 
pentamers revealed a single base difference between the 
third and fifth pentamers. The third pentamer appears to 
hybridize at the fifth pentamer position when the fourth 
pentamer is present in the reaction. 
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Reaction 


Pentamers Used 


Product Length 


1 


2, 3, 4, 5, 6 


26 (primer) 


2 


1,2 


36 


3 


1,2,3 


41 


4 


1, 2, 3, 4 


46 


5 


1, 2, 3, 4, 5 


51, 56 


6 


1, 2, 3, 4, 5, 6 


56 



In reaction 1, the product, which consisted of the primer 
alone, was detected by virtue of a label on the primer. 

Example 4. Competition from "Out of Register" 
Pentamer Set. 

This example demonstrates that the ligation of a set of 
pentamers to a primer in a contiguous chain starting from the 
3' end of the primer is not affected by the presence of a 
second set of pentamers which are also complementary to 
the template and also align contiguously, but begin at a 
position *onc-base-out' firom the 3' end of the primer. Both 
the correct pentamer sets (1-8) and one-base-oul pentamer 
sets (la-Sa) were included in ths reaction along with the 



template and primer during the denaturing, annealing at 60° 

C. and ligation steps. The ligation reactions contained 55 Pentamer 1: 5' Phosphate AGGU'^C 



"correct" pentamers being ligated to the primer. Reaction 10 
confirmed that the presence of primer is required for ligation 
to occur. The one-base-out pentamers did not appear to 
interfere with the ligation of the correct pentamers. 

Example 5. Competition firom Labeled "Out of 
Register" Pentamer. 

The experiment as Example 4 but pentamer la was 
biotinylated. This afforded the opportunity to directly 
35 observe the formation of any l^ation products from the set 
of pentamers la-6a. No ligation products were observed 
from the set of one-base-out pentamers. 

Example 6. Ligation Experiments Using JH 
Downstream Template. 

Primer-directed pentamer ligation products were also 
obtained using as the template a 700 bp DNA downstream 
of immunoglobuLlin heavy chain joining region (JH) cloned 
into a plasmid vector. The JH downstream region was 

45 amplified by PCR, cloned into a plasmid vector which was 
then digested with Eco RI to obtain a sofScient amount of 
the template DNA. The restriction digest was separated on 
an agarose gel, and the DNA band of interest extracted using 
a gel extraction kit (Qiagen). The DNA was resuspended in 

50 distilled water at a concentration of approximately 0.5 /Mg/^. 
The primer and pentamers used with this template are 
shown below: 

21mer Primer: 5' GAAACCAGCITCAAGGCACTG 3' 
(Seq. ID #6) 



equimolar (0.6 fM) concentrations of template, primer and 
each of the pentamers. 

Primer (5' ArTAAGCACAGTOGAAGAAnTCATS') (Seq 
ID #4) 

Pentamer 1: 5' P04-TCU*GT 3' Pentamer la: 5' PO4- 
CTGTT 3' 

Pentamer 2: 5' PO4-TCTCA3' Pentamer 2a: 5' PO4-CTCAG 
3' 

PO4-GTTTU* 3' Pentamer 3a: 5' 



Pentamer 3: 5 

Trrrc 3' 

Pentamer 4: 5' 
CTGGA3' 



PO4- 

P04-CCU*GG 3' Pentamer 4a: 5' PO4- 



Pentamer 2: 5' Phosphate CU*GGA3' 
Pentamer 3: 5' Phosphate GCCU*C 3' 
Pentamer 4: 5' Phosphate CCU* AA 3' 
Pentamer 5: 5' Phosphate GCCCC 3'-Biotin 

Ligations were performed with 500 ng of template, 100 ng 
of primer, and 20 ng of each pentamer in each 20 fiL ligation 
reaction. The number of pentamers was incrementally 
increased in each successive Ugation reaction to show that 
\hQ size of the figation product grew in 5 base increments 
with each addition of a pentamer. 

After performing the ligation reactions according to the 
general method of Example 1, there was a 5-base incremen- 



6,001,614 

23 24 

tal increase in the size of the ligation product with each pentamer concentrations were the same as in the previous 

addition of successive pentamers. There were two bands in experiment. Parallel reactions were perfornaed and the rela- 

the ligation reaction containing the first four pentamers, the tive amount of the four pentamer and five pentamer exten- 

upper band being more intense than the lower band. The size sion products assessed. At 30° C. ligation temperature, the 

of the upper band was the same as when all five pentamers 5 correct size and the non-specific ligation products were of 

were used for the ligation. This is probably because there is the same intensity. More of the correct size ligation product 

sequence similarity between the third and fifth pentamers, so was detected at ligation temperatures of 37° C. and 40° C. 

the third pentamer was ligated also at the fifth pentamer At 45° C, the amount of correct size ligation product 

portion. detected was diminished. 

Example 7. Ligation at Various Temperatures. foregoing description and examples are illustrative 

only and not to be considered as restrictive. It is recognized 

Ligations using the template, primer and the first four that modifications of the specific compounds and methods 

pentamers of Example 6 were performed at 30+ C, 3T C, not specifically disclosed can be made without departing 

40** C. and 45° C. to examine the effect of ligation tempera- firom the spirit and scope of the present invention. The scope 

ture on mismatch discrimination. The template, primer and of the invention is limited only by the appended claims. 



SEQUENCE LISTING 

<160> NUMBER 0? SEQ ID HQS: 7 

<210> SEQ ID NO 1 
<211> LENGTH: 24 
<212> TYPE: DNA 
<213> ORGANISM: primer 

<400> SEQUENCE: 1 

acttcacttc taatgatgat tatg 24 

<210> SEQ ID NO 2 
<211> LENGTH: 24 
<212> TYPE: DNA 
<213> ORGANISM: primer 

<400> SEQUENCE: 2 

ctcttctagt tggcatgctt tgat 24 

<210> SEQ ID NO 3 
<211> LENGTH: 26 
<212> TYPE: DNA 
<213> ORGANISM: prijner 

<400> SEQUENCE: 3 

agtggaagaa tttcattctg ttctca 26 

<210> SEQ ID NO 4 
<211> LENGTH: 25 
<212> TYPE: DNA 
<213> ORGANISM: primer 

<400> SEQUENCE: 4 

attaagcaca gtggaagaat -t-bcat 25 

<210> SEQ ID NO 5 

<2ll> LENGTH: 65 

<212> TYPE: DNA 

<213> ORGANISM: ONA template 

<400> SEQUENCE: 5 

ttaatggtgc caggcataat ccaggaaaac tgagaacaga atgaaattct tccactgtgc 60 
ttaat 65 



<210> SEQ ID NO 6 
<211> LENGTH: 21 
<212> TYPE: DNA 



